Using speech rhythm for acoustic language identification
نویسندگان
چکیده
This paper presents results on using rhythm for automatic language identification (LID). The idea is to explore the duration of pseudo-syllables as language discriminative feature. The resulting Rhythm system is based on Bigram duration models of neighbouring pseudo-syllables. The Rhythm system is fused with a Spectral system realized by parallel Phoneme Recognition (PPR) approach using MFCC’s. The LID systems were evaluated on a 7 languages identification task using the SpeechDat II databases. Tests were performed with 7 seconds utterances. Whereas the Spectral system acting as a baseline system achieved an error rate of 7.9 % the fused system reduced the error rate by 10 % relatively.
منابع مشابه
Rhythmic unit extraction and modelling for automatic language identification
This paper deals with an approach to automatic language identification based on rhythmic modelling. Beside phonetics and phonotactics, rhythm is actually one of the most promising features to be considered for language identification, even if its extraction and modelling are not a straightforward issue. Actually, one of the main problems to address is what to model. In this paper, an algorithm ...
متن کاملDeep Neural Network Bottleneck Features for Acoustic Event Recognition
Bottleneck features have been shown to be effective in improving the accuracy of speaker recognition, language identification and automatic speech recognition. However, few works have focused on bottleneck features for acoustic event recognition. This paper proposes a novel acoustic event recognition framework using bottleneck features derived from a Deep Neural Network (DNN). In addition to co...
متن کاملLanguage identification with suprasegmental cues: a study based on speech resynthesis.
This paper proposes a new experimental paradigm to explore the discriminability of languages, a question which is crucial to the child born in a bilingual environment. This paradigm employs the speech resynthesis technique, enabling the experimenter to preserve or degrade acoustic cues such as phonotactics, syllabic rhythm, or intonation from natural utterances. English and Japanese sentences w...
متن کاملThe Relationship Between Acoustic Characteristics and Personality Dimensions in Patients With Dysphonia
Objectives: Voice is influenced by personality. However, it is still questionable which acoustic features are influenced by personality traits. This study aimed to investigate the relationship between acoustic characteristics and personality dimensions. Methods: Thirty-three participants with dysphonia and 33 participants without dysphonia were recruited to take part in this cross-sectional st...
متن کاملValidating Acoustic Measures of Speech Rhythm for Second Language Acquisition
This paper reports research investigating the validity of using Pairwise Variability Indexes in research into the second language acquisition of speech rhythm. Findings determined that 1) expert native-speakers rate non-native speaker rhythm based on a common factor, and 2) part of that common factor can be accounted for by the use of vocalic pairwise variability. It was concluded that the PVI ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2007